|
|
Accession Number |
TCMCG075C27583 |
gbkey |
CDS |
Protein Id |
XP_017982900.1 |
Location |
complement(join(32007138..32007142,32007358..32008570,32008645..32008725,32008806..32008910,32009720..32009881,32011097..32011204,32011441..32011514,32011788..32011863,32011954..32012041,32012196..32012264,32013200..32013330,32014288..32014450,32014708..32014780,32015458..32015500,32015824..32015904,32016539..32016562)) |
Gene |
LOC18590084 |
GeneID |
18590084 |
Organism |
Theobroma cacao |
|
|
Length |
831aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018127411.1
|
Definition |
PREDICTED: E3 SUMO-protein ligase SIZ1 isoform X2 [Theobroma cacao] |
CDS: ATGGATTTGGTGGCCAGTTGCAAGGATAAATTGGCGTATTTTCGAATAAAAGAACTCAAGGATGTTCTCACTCAATTGGGTCTTTCAAAGCAAGGGAAGAAGCAGGACCTTGTTGAGAGGATATTAGGTGCTCTCTCTGATGAACAAGTTGCAAAGATGTGGGCAAAAAGGACTCCCGTCGGAAAGGAAGATGTGGCAAAACTTGTCGATGACATATACAGAAAGATGCAGGTTTCTGGAGCCACTGAATTGGCATCAAAGGGACAGGGTGTGTCGGACAGCAGTAATGTAAAAGTTAAAGGGGAAATAGATGATCCCTTTCAATCAGATATGAAAGTTCGTTGTCCCTGTGGAAGTTCATTGGAGACTGAGAACATTATTAAGTGTGAAGGTCCAAGATGTCAAGTGTGGCAACATATTCGTTGTGTTATAATTCCAGAGAAGACTATGGAGGGGAATCCACCAGTTCCTGATTTATTTTATTGTGAAATTTGTCGGCTGAGCCAAGCTGATCCTTTTTGGATTACTATTGCGCACCCTTTATGTCCGTTGAAGTTGGCTGTTTCAAATATCCCAAATGATGGTACAAATCCAGTCCTAAGTGCAGAGAAAACATTCCAAATCACCAGGACAGACAAGGACTTACTGACAAAACAAGAGTATGATGTCCAGGCATGGTGCATGCTTCTGAATGACAAAGTTCCATTCAGGATGCAGTGGCCTCAATATGCAGATTTGCAGGTTAATGGCTTACCTGTTCGTGCTATTAATAGACCTGGCTCGCAATTGCTCGGGGCTAATGGCCGTGATGATGGTCCAATTATAACTCCATGTACTAAAGATGGAATTAATAAGATAACTTTGACTGGATGTGATGCTCGCGTATTCTGTTTTGGAGTTAGAATTGTTAAACGGCGAACTGTTCAACAGGTACTTAACATGATTCCCAAGGAGACTGATGGTGAACGTTTTGAAGACGCTCTTGCTCGTGTTTGTCGTTGTGTTGGTGGTGGAACTGCAACAGACAATGGTGATAGTGACAGTGATCTAGAAGTTGTTGCAGATTTTTTTGGAGTCAACCTACGTTGCCCTATGAGTGGTTCAAGAATGAAGGTGGCTGGAAGGTTTAAACCTTGTGTTCACATGGGTTGTTTTGATCTTGAAGTTTTTGTGGAGCTGAACCAACGTTCTAGGAAGTGGCAGTGCCCAATTTGTCTGAAGAACTACTCATTGGAGAACATTATAATTGATCCTTATTTTAATCGCATCACATCCAAGATGAGAAATTGTGGAGAAGATATTACCGAGATTGAGGTCAAGCCTGATGGTTCTTGGCGTGCAAAGGCTAAAAGTGAAAATGAGCGTAGAGAACTTGGTGATCTTGCACAATGGCATTCTCCTGATGGTACTCTATGTGTCCCTGGCAGTGCGGAGGTTAAGCCCAGAGCTGAAACGTCGAAGCAGATCAAACTTGAAGGTGCTTCAGATGGTCATACAGGTTTGAAACTTGGAATCAAGAAGAATAGCGATGGGTTGTGGGAAGTTAGCAAGCCTGAAGATATGAACACGTCTTCTGATAGTAGATTACAAGAAAGATTTGAACATCATGAGCAAAAAATTATTCCAATGAGCAGCAGTGCAACTGGAAGTGTTAAGGATGGTGAAGATCCTAGTGTAAATCAGGATGGTGGTGGGACTTACGACTTTACAAGCAATGGGATTGAACTTGATTCCATGCCTCTGAACATAGATTCGGCATATGAATTTACGGACCGAAATCCATCTGCACCCACAGGAAATGCAGAAGTTATTGTTCTTAGTGATTCAGATGAAGAGAATGACATATTGATATCTTCTGCAACTCTTTATAAGGATAATCAAAATGATTCTTCTGGACTTAATTTTCCAGTGGCTCCTCCTGGAATTTCTCATCCATATTCAGAAGATCCAGCTCTTGGGCCTGCTGGTAATTTGGGTCTTTTTCCTACTAATGATGAATTTGACATGGGTCTGTGGTCATTACCTCCAGGACCCCCAGAAGGCTCTGGGTTTCAACTATTTAGTACAAATGCAGATGTCTCAGATGCCTTAGTTGATCTGCAGCGTAATGCTCTCAATTGCCCTCAATCAATGAATGGTTATACATTGGCTCCAGAGACAACAATGGGATCTGCAAATCTAGTCCCTGGTTCTTCCATTGGTCAGACTGATACAGATATAAATGATCGCTTAGTTGACAATCCCTTGTTTGGTGCAGAAGATCCTTCTCTTCAGATATTTCTTCCAACTCGTCCTTCAGATGCATCAGCACAGTCTGATTTGAGAGATCAAGCTGATGTATCAAATGGCATCCGTACTGATGATTGGATTTCACTTAGGCTTGGAGATGGTGCAACTGGTGGTCATGGGGACTCAACAACAGTGAATGGATTGAATTTAAGGCAGCAGATACCTTCCAGAGAACGCACTATGGATTCTTTGGATGACACCGATTAG |
Protein: MDLVASCKDKLAYFRIKELKDVLTQLGLSKQGKKQDLVERILGALSDEQVAKMWAKRTPVGKEDVAKLVDDIYRKMQVSGATELASKGQGVSDSSNVKVKGEIDDPFQSDMKVRCPCGSSLETENIIKCEGPRCQVWQHIRCVIIPEKTMEGNPPVPDLFYCEICRLSQADPFWITIAHPLCPLKLAVSNIPNDGTNPVLSAEKTFQITRTDKDLLTKQEYDVQAWCMLLNDKVPFRMQWPQYADLQVNGLPVRAINRPGSQLLGANGRDDGPIITPCTKDGINKITLTGCDARVFCFGVRIVKRRTVQQVLNMIPKETDGERFEDALARVCRCVGGGTATDNGDSDSDLEVVADFFGVNLRCPMSGSRMKVAGRFKPCVHMGCFDLEVFVELNQRSRKWQCPICLKNYSLENIIIDPYFNRITSKMRNCGEDITEIEVKPDGSWRAKAKSENERRELGDLAQWHSPDGTLCVPGSAEVKPRAETSKQIKLEGASDGHTGLKLGIKKNSDGLWEVSKPEDMNTSSDSRLQERFEHHEQKIIPMSSSATGSVKDGEDPSVNQDGGGTYDFTSNGIELDSMPLNIDSAYEFTDRNPSAPTGNAEVIVLSDSDEENDILISSATLYKDNQNDSSGLNFPVAPPGISHPYSEDPALGPAGNLGLFPTNDEFDMGLWSLPPGPPEGSGFQLFSTNADVSDALVDLQRNALNCPQSMNGYTLAPETTMGSANLVPGSSIGQTDTDINDRLVDNPLFGAEDPSLQIFLPTRPSDASAQSDLRDQADVSNGIRTDDWISLRLGDGATGGHGDSTTVNGLNLRQQIPSRERTMDSLDDTD |